A wave decoder for continuous speech recognition
نویسندگان
چکیده
In this paper, a wave decoder based on the general re-entrant network for continuous speech recognition is described. The decoder design is based on the concept of self-adjusting decoding graph in which the decoding network is expanded and released frame-synchronously. The fast network expansion and release are made possible by utilizing a novel dynamic network sca olding layer. The self-adjusting decoding graph is obtained by slicing the traditional decoding network horizontally for separation of di erent knowledge sources and vertically according to each time instant in search. A two layer hashing structure and an admissible arc predication scheme are described. These methods signi cantly reduce the arc mortality rate, a problemwhich plagues the e ciency of the dynamic decoder. Experimental results demonstrate that an order of magnitude reduction of decoding resources can be achieved based on the proposed approach.
منابع مشابه
Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملImproved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملAn Efficient A* Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model
The stack decoder is an attractive algorithm for controlling the acoustic and language model matching in a continuous speech recognizer. A previous paper described a near-optimal admissible Viterbi A* search algorithm for use with noncross-word acoustic models and no-grammar language models [16]. This paper extends this algorithm to include unigram language models and describes a modified versi...
متن کاملA tree-trellis n-best decoder for stochastic context-free grammars
In this paper a decoder for continuous speech recognition using stochastic context-free grammars is described. It forms the backbone of the ACE recognizer, which is a modular system for real-time speech recognition. A new rationale for automata is introduced, as well as a new model for pruning the search space.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996